Target Selection as Variable Selection: Using the Lasso to Select Auxiliary Vectors for the Construction of Survey Weights∗

نویسندگان

  • Devin Caughey
  • Erin Hartman
چکیده

Survey nonresponse is a ubiquitous problem in modern survey research. As individuals have become less likely to respond to surveys there has been a simultaneous rise in highly granular data sources that can be used to help ameliorate the nonresponse problem. While much research has been done on post-hoc weighting methods, which provide a flexible and general solution for unit nonresponse, there is an open question of how to select the optimal auxiliary vector to include in the weighting method. We formulate this as a methodological question of variable and interaction selection where the goal is, assuming an individual level stochastic response probability, to construct an optimal set of weights for each individual respondent to account for an observed pattern of nonresponse. We use recent literature on hierarchical group-lasso regularization to determine the best auxiliary vector for weighting. We show the advantages of this method in simulations that are derived from real survey data sampled off of an individual level voter file in recent elections. We also apply the method to historic quota sampled survey data from the 1930s and 1940s to show the advantages of this method even where the sampling design is unknown. ∗For comments and assistance, we thank Adam Berinsky, Sara Chatfield, Liz Gerber, Eric Schickler, Jasjeet Sekhon, Mallory Wang, Yiqing Xu, and Teppei Yamamoto. We are thankful for the comments from the participants of the 2016 MPSA conference, 2017 Pacific Coast American Politics Conference, as well as the Washington University at St. Louis American Politics and Stanford American Politics workshops. We also gratefully acknowledge the willingness of BlueLabs to share their data with us.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selection of internal safety auditors in an Indian construction organization based on the SWARA and ARAS methods

Background: The effectiveness and adequacy of occupational health and safety management system should be monitored and evaluated at organization level on a regular basis. Safety audit has a clear role in the development of organizations safety management systems. Internal safety audit is a method to appraise to the management the current status of occupational health and safety at workplace. Se...

متن کامل

Application of genetic algorithm (GA) to select input variables in support vector machine (SVM) for analyzing the occurrence of roach, Rutilus rutilus, in streams

Support vector machine (SVM) was used to analyze the occurrence of roach in Flemish stream basins (Belgium). Several habitat and physico?chemical variables were used as inputs for the model development. The biotic variable merely consisted of abundance data which was used for predicting presence/absence of roach. Genetic algorithm (GA) was combined with SVM in order to select the most important...

متن کامل

Prioritizing Target Markets for the Export of Iran Construction Services

Abstract T ire cent decades, the internationalization of trade and companies' engagement in the global market has been found with a double significance. Planning and investment to increase the share of non-oil exports in total exports and reform its combination to export of goods and services with higher technology are of approaches of vision plan that export of Construction Services ...

متن کامل

Distributed and Cooperative Compressive Sensing Recovery Algorithm for Wireless Sensor Networks with Bi-directional Incremental Topology

Recently, the problem of compressive sensing (CS) has attracted lots of attention in the area of signal processing. So, much of the research in this field is being carried out in this issue. One of the applications where CS could be used is wireless sensor networks (WSNs). The structure of WSNs consists of many low power wireless sensors. This requires that any improved algorithm for this appli...

متن کامل

Site selection for underground dam using the water balance simulation (SWAT model) and analytical network process (ANP) (case study: Jamyshan watershed)

A large part of Iran is considered a semi-arid regions and water is a limiting factor for human activities in these areas. One of the ways to eliminate seasonal shortages of water is using groundwater. Underground dams can be used for storing and using underground water, which can be used to improve the management of existing water resources and increase the productivity of these resources.1 . ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017